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We consider the production and detection of a sequential, down type quark via the 
mode pp — )• h'b' — )• W^W~ti — )■ Iv^Sj at the LHC, with the colhsion energy y/s = 10 
TeV and the total integrated luminosity around 1 fb^^. We assume my = m^/ = 600 
GeV. A full reconstruction is employed and the signal and background discrimination 
is studied within a neural network approach. Our results show that this mode can 
make a useful contribution to the b' search. 
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A sequential fourth family is a well-defined extension of the standard model [l, 2] which 
could have important irnplications for our understanding of electroweak symmetry breaking 
and the flavor problem |3|, l^]- In order to confirm or rule out the fourth family at the LHC 
it will be important to be able to test for the existence of both the t' and V . The detection 
of t' could first occur in the lepton plus jets mode, as with the discovery of top quark, via 
t't' — )■ W^W^hh where one W decays leptonically js^. The process h'h' — )■ W^W~qq is 
similar if it is assumed that g is m or c 

lepton plus jets mode faces a considerable combinatoric problem 



61. But when h' dominantly decays toW t then the 
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m 



The search for the t' can benefit from the use of jet invariant masses to identify ly-jets 



10| . since the W^s are more likely to be both boosted and isolated when coming from the 



t't' rather than from tt. Here a relatively large cone size is used in the jet finding algorithm, 
and the ly-jet can be simply combined with another jet to reconstruct the t' mass. This is 
not as successful for the h' since here the W must be combined with a reconstructed t. This 
would require higher mass 6"s (as for the vectorlike quarks in ll|]) where the t's are more 
boosted. 

When two VT's decay leptonically then the same-sign lepton signal becomes available 12|. 
This has a branching fraction almost 6 times smaller than the single lepton mode, but it 
enjoys the advantage that the standard model background is small. The CMS collaboration 
has adopted this channel to explore the discovery potential of h'h' [l3]. However without 
reconstructing all objects in the decay chains it is difficult to distinguish a fourth family 
signature from that of other new physics producing a same-sign lepton signature. For in- 
stance this signature might come from fourth family neutrinos, especially when they have 



Major ana masses 
in SUSY models 
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151] . Same-sign W^s or t's or charginos can appear in final states 



171]. A same-sign top pair can be produced by exchanging a neutral 



scalar or vector boson in t and u channels 



Iq . In fiavor models for neutrino physics a dou- 



bly charged particle can give rise to same-sign leptons |l9l . |20| . An attempt to reconstruct 
the other objects in the events containing same-sign leptons is hampered in the case of h'V 
production in the context of a fourth family. In this case the same-sign leptons must emerge 
from different heavy quarks. In principle the transverse mass of each h' can be reconstructed 
via the MT2 method 2l|] but one must again confront nontrivial combinatorics. 

A comprehensive study of the heavy quarks in vectorlike models has been conducted 
by 0], where signatures with four-, three-, double-, and single-lepton + jets have been 
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studied using the likelihood method. However, the study of the discovery potential for the 



lepton plus jets mode h'h' — )■ W~tW^t — )■ d-vSi is left undone apparently due to the large 
combinatorics. This mode was explored in the ATLAS Design Report where it also does 
not appear to be a useful discovery mode. Here we shall explore whether these difficulties 
may be at least partially overcome by using a full reconstruction method combined with a 
neural network analysis. 

We are considering a complete sequential fourth family of chiral matter fields and we 
choose iriti = my = 600 GeV. It is typically thought that m^/ > m^/ from electroweak 
precision constraints 22|. One of our goals is to separate the effects of the t' from the b' 
signal, and so a larger t' would only make this easier. Thus in this sense our rrit' choice is 
conservative. 600 GeV masses are close to the unitarity upper bound j^sl, and so this is 
also a conservative choice for considering the discovery reach. We assume that the b' decays 
predominantly to t, as might be expected from an extended CKM matrx. It is certainly 
consistent with current bounds inferred from single production of t at Tevatron 25|, |26| and 
from the global fit from precision data and low energy processes 27|, |28|. Assuming that 
branching fraction for b' — )■ Wt is unity, the decay modes and branching fractions from b'b' 
are as displayed in Table (jl]). 
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TABLE I. Branching fractions of decay mode of b'b' are shown. 



We shall attempt to reconstruct all objects in each event and to collectively use both the 
kinematic observables and the observables obtained from reconstructed objects to suppress 
background. Both heavy quarks in each event will be reconstructed. We will only consider 
the dominant backgrounds from ti + nj and W + nj, since other backgrounds can be safely 



neglected 
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24j |. For the effects of QCD corrections we must estimate K factors appropriate 



to the large center of mass energies and large number of jets in events passing our cuts. We 



use K = 1.5 for bot 
W + nj background 



1 the signal events and the tt + nj background, and K = 1 for the 
3-31|. 



We use Madgraph/MadEvent (32| to generate signal events and Alpgen [33| to generate 
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background events. The MLM parton-jet matching method is used where for the tt + nj 
samples, PTmin = 100 GeV, while for the W + nj samples, PTmin = 150 GeV. These values for 
the Alpgen pTmin parameter are appropriate for the large center of mass energies . Alpgen is 
designed so that physical results are quite insensitive to the value of Prmin as long as extreme 
values are not taken. Changing PTmm amounts to a rebinning of the jet multiplicity samples, 
and our choice of PTmin implies that the back grou nd is dominated by the tt + lj, ti+{> 2j), 



W + 2j and W + {> 3j) samples 10|. Pyth ia 34j | is used to simulate shower, fragmentation, 
hadronization and decay processes. PGS [35] is used to simulate the detector effects and 
to find jets, leptons, and missing energy in each event. We modify the PGS code slightly 



so as to adopt the anti-^T algorithm 36| to find jets in the events. For the jet resolution 
parameter we choose R = 0.4. 

We adopt the following preselection rules: 

• Jets are required to have prU) > 20 GeV. 

• There is only one energetic lepton with Pt{^) > 20 GeV {£ = e , fi) and the missing 
energy satisfies 20 GeV. 

• We impose s > 1200 GeV and Ht > 900 GeV. s is the invariant mass obtained from 
the momentum sum over lepton, missing energy, and jets which pass all cuts. Ht is 
the scalar sum of transverse momentum over lepton, missing energy, and jets which 
pass all cuts. 

The selection efficiencies for each of these preselection rules can be found in Table (II). In 
Table (III) we show the jet multiplicity samples for rij = 6, rij = 7, and rij > 8. This shows 
that the samples with rij = 7 and rij > 8 have superior signal to background ratios and 
so we concentrate on those. We do not consider b tagging since it does not help much to 
separate signal from the tt + nj background. 

Our full reconstruction of four W^s, two t's, and two &"s is a follows. 

1) Find the two- fold solutions of the z-component of the neutrino momentum by assuming 
that the lepton and missing energy are from the W leptonic decay. 



2) Loop over all jets and combine 6 jets into 3 hadronic W^s. 
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h'U 


t'i' 


VF+jets 


tt + jets 


Pt{() > 20GcV 
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$> 20GeV 
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28% 
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No. of Events with 1 fb~^ 


66.8 


56.9 


831.1 


536.1 



TABLE II. The selection efficiencies of the preselection rules are shown. Note that we have also 
used Ht cuts (less than 900 GeV) in Alpgen to generate the background events. In the last row, 
we normalize the number of events by assuming the integrated luminosity as 1 fb~^. K factors are 
not included. 
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TABLE III. The data samples of different jet multiplicities are shown. All events pass the prese- 
lection rules given in Table (II). K factors are not included. 

3) Pair four W candidates (three hadronic and one leptonic two of which come from 
t) and two b jet candidates into 2 6"s and evaluate the function, which is defined 
as 



X — Z^i=i ^2 H ^ 



+ Z^i=i ^ \ ^. — 



w 

2 



(1) 



awt. =11 GeV, aw — 14 GeV, at — 20 GeV, and ab' — 25 GeV arise from the 
resolution of the calorimeters detectors where the cr^'s are assumed to be the sum of 
those of the decay products. 



4) For each event, from all possible pairings including the neutrino two-fold ambiguity, 
we choose the one with minimum ^ the right reconstruction of all objects. 
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For the channel Uj > 8 we assume that all W bosons have two jets while for the channel 
rij = 7 we assume that one of the hadronic W bosons can be a single jet. 

To further enhance signal to background it appears that we must resort to multidimen- 
sional variable analysis methods, such as likelihood, boosted decision tree or neural network 
methods. We choose the last of these methods and use the multilayer perceptron method 
which has been implemented in the ROOT package. 

The basic idea of this method in data analysis is to employ the high dimensional feature 
space to better separate signal and background. This method has a long history in particle 



physics [37[, it has been developed into c 
used in top quark precision measurement 



uite a mature form 



and it has been widely 



. Multiple jet final states, for example the full 



hadrouic «-ey... have been suecessfuHy separated fro,„ the huge background by u.ng the 



NN method 
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4l| . where data sample with rij = 6, 7, 8 are considered. It should thus be 
feasible to apply it to a b' search at the LHC 

The discriminating observables can be divided into two groups, as observables obtained 
before and after the reconstruction procedure. The first group includes: 

• the transverse momenta of the leading 4 jets, the lepton and the missing transverse 
energy 

• the leading 4 invariant masses of jets 

• Ht, s, a (aplanarity), S spherity, C (centrality), pf^"^ (the scalar sum of z component 
of momenta) 

• the number of jets with momentum larger than 120 GeV, 60 GeV, 30 GeV, and 20 
GeV, respectively. 

• the first, second, and third minimum invariant mass fn{ji,j2) of pair of jets 

• the first, second, and third minimum R{ji,j2) of pair of jets 

• the (f) angle between lepton and missing energy 



Most of these observables are adopted by the Tevatron groups for the t measurements 
42j. 

The second group of observables are: 



40 
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• Masses and momenta of reconstructed objects, i.e. four 1^'s, two fs, two b jets, and 
two b' heavy quarks. 

• The angle of the b' relative to the z direction in the center of mass frame of the event. 
By combining the charge of lepton with whether the leptonic W is isolated (not from 
top) or nonisolated (from top), we can infer which heavy quark is b'. 

• We also reconstruct all events in terms of a tt production hypothesis. Here the iden- 
tification of objects is based on 

Xtt - 2^i=i ^ ^ 2^i=i ^ ■ l^J 

Then we use the angle between the two t's and the angle between the W and b from 
each t decay as discriminating observables. 

Our neural network has three layers: input layer, hidden layer, and output layer. For the 
rij > 8 sample, we have ended up with 24 input layer and 48 hidden layer observables. For 
the rij = 7 sample, we have ended up with 18 input layer and 36 hidden layer observables. 
The output layer, valued from —1 to 1, discriminates between background and signal. We 
use the default training method (BFGS) and the default parameters of the method encoded 
in the ROOT package. Other training methods in the ROOT package did not yield better 
results. We also find that various subsets of the observables we have chosen can yield 
quite similar results, but we did not find it worthwhile to try to minimize the number of 
observables. 

We would like to compare the application of the NN method to the traditional counting 
method. For the latter we need to find a few most powerful discriminants which can sepa- 
rate the signal and background best and apply cuts to them sequentially, for instance, the 
reconstructed mass bumps oi b', t, and W, the Ht distribution, the sphericity and Pt of the 
leading 3 jets, etc. In our attempt to use this method we were not able to obtain significances 
much greater than unity. This shows how the NN method helps in the optimization of cuts. 
Furthermore, for our expanded set of kinematic observables the final performance of the NN 
method is quite stable and is almost independent of the choice of variables. So in this sense 
the apphcation of the NN method is more straightforward than the careful selection and 
tuning required in the traditional counting method. 



8 



Reconstructed mass of b' before NN | 



signal/background in NN discriminant | 



Reconstructed mass of b' after NN 




200 400 600 800 1000 1200 

mass of b' (GeV) 

Reconstructed mass of b' before NN | 





-0.5 0.5 1 1.5 

NN discriminant distribution 

I signal/bacl<ground in NN discriminant | 



200 400 600 800 1000 1200 

mass of b' (GeV) 



825 

LU 

20 
15 
10 
5 

■i. 



Mb' b' 

- tl+njets 

— w+njets 

"' t'F 





200 400 600 800 1000 

mass of b' (GeV) 



1200 



Reconstructed mass of b' after NN 



|m^.=m,.=600 Ge\^ | 
r 1 


lb' b' 


r 1 

1 A 


- tl+njets 




— w+n jets 


>-^H 1 


,,, ,.p 


i^H 

1 1 
r if 

■ ■ ■ 1 . .£t!fl|TT'l!V 





-1 -0.5 0.5 1 1.5 

NN discriminant distribution 



mass of b' (GeV) 



FIG. 1. The reconstructed mass peak of a 600 GeV mass b' for both signal and background is 
shown for the rij > 8 sample, where we provide both stacked (upper row) and unstacked (lower 
row) plots. The result before applying the neural network is shown in the left column and the 
neural network discriminant is shown in the middle column. With a discriminant cut of 0.15, the 
final result is shown in the right column. The m^' observable has been temporarily removed from 
the neural network to produce these plots. The assumed luminosity is 1 fb~^and there are two 
contributions to the histogram from each event from the two values for nih' ■ K factors are included. 



In Fig. 1 we show the reconstructed mass peak of a h' with a 600 GeV mass, with the 
integrated luminosity 1 fb~^ from the Uj > 8 sample. The left and right columns show 
results without and with the neural network, while the middle shows the discriminant. We 
see that the W + jets background is small, which is largely the result of requiring a large 
number of jets. We find that the neural network discrimination improves S/B hj a factor 
of about 4 for the rij = 7 sample, and 3 for the Uj > 8 sample. And we also see that the b'b' 
signal can be quite effectively isolated from the contribution of t'i' with rrit' = 600 GeV. 

Table IIVI summarizes our results. It also shows the effect of a modified reconstruction 
that makes use of at least one required 6-tag. The result is a lower significance, which is 
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TABLE IV. The number of events, normalized to 1 fb~^, for both signal and background after 
applying neural network, for the Uj = 7 and rij > 8 jet multiplicities, and with and without a 
required 6-tag. We combine the t't' and b'b' events to define the signal S. The effects of the 
normalization uncertainty of the background processes are indicated in the last two columns. K 
factors improve the results but are not included here (see the Summary). 



not surprising since the dominant background from tt + jets also has 6-jets. We used b- 
tagging efficiencies of 0.6, 0.1 and 0.01 for b, c and light quarks respectively. There could be 
other reasons to employ 6-tagging given that it reduces the combinatorics for proper event 
reconstruction. For example it could help to overcome the effects of pile-up in these high jet 
multiplicity events. 

There is a systematic uncertainty on the overall normalization of the backgrounds due to 
our reliance on a Monte Carlo estimate. There are also other systematic uncertainties such 
as those related to the jet energy scale. To estimate the effects of such uncertainties, we 
show in the last two columns of Table [IV] the significance in two cases where the background 
normalization (for both ti and W + jets) is allowed to fiuctuate up by 20% and 10% (a 
method used for example in 43||). Statistical uncertainties will be of lesser importance as 
the integrated luminosity is increased. 



SUMMARY 

We have studied the specific case of rrib' = mf = 600 GeV with = 10 TeV as a 
case study to test the feasibility of a full reconstruction method for the mode pp — ?■ b'b' — >■ 
W^iW~t — )■ iueSj. This appears to be workable even though there are 7 or 8 jets and 
large combinatorics (over 10^). By applying the neural network technique the significance 
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S/\/S + B can reach 4.0. (From Table IV after combining two multiplicity samples we have 
S/ \/ S + B = 3.3 X Vl.S = 4.0 where a/1-5 is from the K factors.) The significance drops to 
2.6 or 3.8 for 20% or 10% upward fluctuations in the background. When a 6-tag is required 
the significance is 3.8. This is for an integrated luminosity 1 fb~^. For ^/s = 7 or 14 TeV 
the significance changes by a factor of about 1/2 and 3 respectively, and so in the case of 7 
TeV this reduction will be overcome by a larger integrated luminosity. 

To address the systematic uncertainties we expect that a more data-driven approach to 
background subtraction will be adopted by experiment ahsts. We would like to point out 
from Fig. (1) that the discriminant distributions (middle plots) can be at least as useful 
as the mass distributions in the development of a subtraction procedure. Improvements in 
our analysis could come from a jet finding algorithm that is optimized for the treatment of 
highly boosted W^s that cannot be resolved as two jets. Information on the substructure of 



these massive jets may be useful or alternatively a continuous jet cone algorithm 4^ might 
be used, where the jet cone size can be smaller than 0.4. 

We can compare the sensitivity of the lepton plus jets mode to the two lepton same- 
sign mode. The latter mode was studied in ^ for the same ^/s, luminosity and masses 
as considered here. With the following restrictions on event selection, Ht > 1 TeV, 2 
isolated same-sign leptons, |;> 50 GeV and M{i^e^) > 100 GeV, the number of expected 



background events is essentially zero 12, ll3ll- The number of signal events was found to 



be 7 corresponding to a significance of 2.6. The opposite sign two lepton mode may also 
useful to consider |7|. These various analyses can be combined to enhance the sensitivity to 
b' while eliminating other new physics explanations of the signal. 

For the case of similar t' and b' masses at or below 600 GeV our results lead us to 
conclude: 1) a helpful search for b'b production at the LHC can be made via the lepton plus 
jets mode; 2) the b' mass bumps can be successfully reconstructed by using a method; 
3) a simple cut on the number of jets is effective at suppressing the t' events (for similar t' 
and b' masses); 4) a neural network analysis is useful for background discrimination; 5) the 
resulting significance could be comparable to that of the same-sign leptonic mode. These 
results are complementary to 0, Q, 24 1 and can help experimentalists to decide how to 



distinguish a b' signal, for example from same-sign leptons, from other new physics. 
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